RAW SEQUENCE LISTING 
ERROR REPORT 



The Biotechnology Systems Branch of the Scientific and Technical Information 
Center (STIC) detected errors when processing the following computer readable 
form: 

Application Serial Number: 09/ 99L 

Source: 7 _ OM 

Date Processed by STIC: Zfi^/ZoeT^ 

THE ATTACHED PRINTOUT EXPLAINS DETECTED ERRORS. 

PLEASE FORWARD THIS INFORMATION TO THE APPLICANT BY EITHER: 

1) INCLUDING A COPY OF THIS PRINTOUT IN YOUR NEXT COMMUNICATION TO THE 
APPLICANT, WITH A NOTICE TO COMPLY or, 

2) TELEPHONING APPLICANT AND FAXING A COPY OF THIS PRINTOUT, WITH A 
NOTICE TO COMPLY 

FOR CRF SUBMISSION QUESTIONS, PLEASE CONTACT MARK SPENCER, 703-308-4212. 

FOR SEQUENCE RULES INTERPRETATION, PLEASE CONTACT ROBERT WAX, 703-308-4216. 
PATENTIN 2.1 e-mail help: patin21help@uspto.gov or phone 703-306-4119 (R. Wax) 
PATENTED 3.0 e-mail help: patin3help@uspto.gov or phone 703-306-4119 (R. Wax) 

TO REDUCE ERRORED SEQUENCE LISTINGS, PLEASE USE THE CHECKER 
VERSION 3.1 PROGRAM, ACCESSIBLE THROUGH THE U.S. PATENT AND 
TRADEMARK OFFICE WEBSITE. SEE BELOW FOR ADDRESS: 
http;//www.uspto.gov/web/offices/pac/checker 

Applicants submitting genetic sequence information electronically on diskette or CD-Rom should be aware that there is 

a possibility that the disk/CD-Rom may have been affected by treatment given to all incoming mail. 

Please consider using alternate methods of submission for the disk/CD-Rom or replacement disk/CD-Rom. 

Any reply including a sequence listing in electronic form should NOT be sent to the 2023 1 zip code address for the 

United States Patent and Trademark Office, and instead should be sent via the following to the indicated addresses: 

1. EFS-Bio (<http://www.nspto.gov/ebc/efs/downloads/documents.htm> , EFS Submission 
User Manual - ePAVE) 

2. U.S. Postal Service: U.S. Patent and Trademark Office, Box Sequence, P.O. Box 2327, Arlington, VA 22202 

3. Hand Carry directly to: 

U.S. Patent and Trademark Office, Technology Center 1600, Reception Area, 7* Floor, Examiner Name, 
Sequence Information, Crystal Mall One, 1911 South Clark Street, Arlington, VA 22202 
Or 

U.S. Patent and Trademark Office, Box Sequence, Customer Window, Lobby, Room 1B03, Crystal Plaza Two, 
201 1 South Clark Place, Arlington, VA 22202 

4. Federal Express, United Parcel Service, or other delivery service to: U.S. Patent and Trademark Office, 
Box Sequence, Room 1B03-Mailroom, Crystal Plaza Two, 201 1 South Clark Place, Arlington, VA 22202 




Revised 01/29/2002 



/ 

Raw Sequence listing Error Summary 



ERROR DETECTED SUGGESTED CORRECTION SERIAL NUMBER: 0 9 

ATTN: NEW RULES CASES: PLEASE DISREGARD ENGLISH "ALPHA" HEADERS, WHICH WERE INSERTED BY PTO SOFTWARE 

1 Wrapped Nucleics The number/text at the end of each line "wrapped" down to the next line. This may occur if your file 
Wrapped Aminos was retrieved in a word processor alter creating it Please adjust your right margin to .3; this will 

prevent "wrapping." 

2 Invalid Line Length The rules require that a line not exceed 72 characters in length. This includes white spaces. 

The numbering under each 5* amino acid is misaligned Do not use tab codes between numbers; 
use space characters, instead. 

The submitted file was not saved in ASCII(DOS) text, as required by the Sequence Rules. Please 
ensure your subsequent submission is saved In ASCII text. 

Sequences) contain n's or Xaa's representing more than one residue. Per Sequence Rules, 

each n or Xaa can only represent a single residue. Please present the maximum number of each 
residue having variable length and indicate in the <220>-<223> section that some may be missing. 

A "bug" in Patentln version 2.0 has caused the <220>-<223> section to be missing from amino acid 

sequencers) . Normally, Patentln would automatically generate this section from the 

previously coded nucleic acid sequence. Please manually copy the relevant <220>-<223> section to 
the subsequent amino acid sequence. This applies to the mandatory <220>-<223> sections for 
Artificial or Unknown sequences.' 

Sequences) missing. If intentional, please insert the following lines for each skipped sequence: 

(2) INFORMATION FOR SEQ ID NO:X: (insert SEQ ID NO where "X" is shown) 
(i) SEQUENCE CHARACTERISTICS: (Do not insert any subheadings under this heading) 

(xi) SEQUENCE DESCRIPTION:SEQ ID NO:X: (insert SEQ ID NO where "X" is shown) 
This sequence is intentionally skipped 

Please also adjust the "(ii) NUMBER OF SEQUENCES:" response to include the skipped sequences. 

Sequences) missing. If Intentional, please insert the following lines for each skipped sequence. 

<210> sequence id number 
<400> sequence id number 
000 

Use of n's and/or Xaa's have been detected in the Sequence Listing. 

Per 1.823 of Sequence Rules, use of <220>-<223> is MANDATORY if n's or Xaa's are present. 
In <220> to <223> section, please explain location of n or Xaa, and which residue n or Xaa represents. 
,» 

Per 1.823 of Sequence Riles, the only valid <213> responses are: Unknown, Artificial Sequence, or 
scientific name (Genus/species). <220>-<223> section is required when <213> response is Unknown or 
is Artificial-Sequence ■ • • • 

Sequence^) missing the <220> "Feature" and associated numeric identifiers and responses. 

Use of <220> to <223> is MANDATORY if <213> "Organism" response is "Artificial Sequence" or 
"Unknown." Please explain source of genetic material in <220> to <223> section. 
(See "Federal Register," 06701/1998, Vol. 63, No. 104, pp. 2963 1-32) (Sec 1.823 of Sequence Rules) 

12 Patentln 2.0 Please do not use "Copy to Disk" function of Patentln version 2.0. This causes a corrupted file, 

"bug" resulting in missing mandatory numeric identifiers and responses (as indicated on raw sequence 

listing). Instead, please use "File Manager" or any other manual means to copy file to floppy disk. 

13 Misuse of n n can only be used to represent a single nucleotide in a nucleic acid sequence. N is not used to represent 

any value not specifically a nucleotide. 




3 1/ Misaligned Amino 
Numbering 



Non- ASCII 



Variable Length 



6 Patentln 2.0 

"bug" 



Skipped Sequences 
(OLD RULES) 



8 Skipped Sequences 

(NEW RULES) 



Use of n's or Xaa's 



(NEW RULES) 



10 Invalid <213> 
Response 



11 Useof<220> 
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OIPE 



RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/996 , 611A 



DATE: 05/14/2002 
TIME: 15:50:52 



Input Set : A:\SEQUENCE LISTING . txt 
Output Set: N:\CRF3\05142002\I996611A.raw 



Does Not Comply 
Corrects Diskette Weeded 



1 <110> APPLICANT: Chou, Min-Yuan 

2 Leu, Charng-Yih 

4 <120> TITLE OF INVENTION: Novel Human alpha 1 Chain Collagen 
6 <130> FILE REFERENCE: 32350-176844 

8 <140> CURRENT APPLICATION NUMBER: US 09/996, 611A 

9 <141> CURRENT FILING DATE: 2001-11-30 

11 <150> PRIOR APPLICATION NUMBER: TAIWAN 89128027 

12 <151> PRIOR FILING DATE: 2000-12-27 
14 <160> NUMBER OF SEQ ID NOS : 5 

16 <170> SOFTWARE: Microsoft Word 



ff 



ERRORED SEQUENCES 



18 
19 
20 
21 
23 
24 
25 
27 
28 

E--> 29 

30 

E--> 31 

32 

E--> 33 

34 

E--> 35 

36 

E--> 37 

38 

E--> 39 

40 

E--> 41 

42 

E--> 43 

44 

E--> 45 

46 

E--> 47 

48 



<210> SEQ ID NO: 1 

<211> LENGTH: 954 

<212> TYPE: PRT 

<213> ORGANISM: Homo sapiens 

<220> FEATURE: 

<223> OTHER INFORMATION: alpha 1 chain collagen 

Pro in this sequence stands for hydroxyproline 
<400> SEQUENCE: 1 

Met Ala His Tyr Tie Thr Phe Leu Cys Met Val Leu Val Leu Leu 
1 5 10 15~^ > 

Leu Gin A~Sfi Ser vai--Leu Ala Glu Asp Gly Glu Val Arg Ser Ser 
( ^20 25 3 0^ 

C^s Arg Thr Ala Pro^Jhr Asp Leu Val Phe lie Leu Asp Gly Ser 
(^35 40 45 1 

Tyr— S er Val Gly P ro Glu Asiv Phe Glu lie Val Lys Lys Trp Leu 
Cj50 55 60 J 

falAsn lie Thr Lys ASh _ Phe _ A , fep lie Gly Pro Lys Phe lie Gin 

65 70 75 ) 

aT~CTy- Val Val Gin Tyr -SgfAsfe Tyr Pro Val Leu Glu lie Pro 
V40^ _ 85 3jL ^^ 

Leu Gly" Ser Tyr Asp tief/Gly Glu His Leu Thr Ala Ala Val Glu 
< VJJ5 100^105, 

Ser~~lle Leu Tyf~~Leu Gly CjAy Asn Thr Lys Thr Gly Lys Ala lie 
V110 115 120 

,Gln PKe Ala Leu Asp Tyr Le"u Phe Ala Lys Ser Ser Arg Phe Leu 
V125 130 135 

^ThrEys— J-1 e ftlri V-alr-Val) Leu Thr Asp Gly Lys Ser Gin Asp Asp 
^L40 145 150 

Val Lys~~ Asp Ala Ar 3"Gln Ala Ala Arg Asp Ser Lys lie Thr Leu 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/996, 611A 



DATE: 05/14/2002 
TIME: 15:50:52 



Input Set : A:\SEQUENCE LISTING .txt 
Output Set: N:\CRF3\05142002\I996611A.raw 



E- 


- > 


49 






50 


E- 


-> 


51 






52 


E- 


-> 


53 






54 


E- 


- > 


55 






56 


E- 


-> 


57 






58 


E- 


-> 


59 






60 


E- 


-> 


61 






62 


E- 


-> 


63 






64 


E- 


-> 


65 






66 


E- 


-> 


67 






68 


E- 


-> 


69 






70 


E- 


-> 


71 






72 


E- 


-> 


73 






74 


E- 


-> 


75 






76 


E- 


-> 


77 






78 


E- 


-> 


79 






80 


E- 


-> 


81 






82 


E- 


-> 


83 






84 


E- 


-> 


85 






86 


E- 


-> 


87 






88 


E- 


-> 


89 






90 


E- 


-> 


91 






92 


E- 


-> 


93 






94 


E- 


-> 


95 






96 


E- 


-> 


97 



49 ^155 160 

50 Phe~-Ate- lie Gly - 
175 180 




70 Val Asp Lys lie Leu Leu Ph 
" 320 325 33 



340 34 



e_jrhr 



355 



76 Val Thr Leu Tyfile Asp As 

77 Q&5. 370 375 



lis Pi 
C380 



385 



r$ Tyr Ser Gly Lys u±u 
-3-9.5 400 40_5->' 



390 ^ 



82 Leu Arg lie Tyr Cys As 
C410 415 42C 

^yTs~GTtr-a o Pro Gly - 

C425 430 

Thr Pro Ala Pro Cys lie 
4-40___ 445 450 



3 Asp~P*.o 
435___--^ 



6 Ser 

7 ^4-4 



in Gly Pro Lys Giy Asp -Pi 
^t^J^ 460 465 \y 



475 



480 



Ai^a G 




490 495 



505 



Jly TLyi 
_5JJK 



97 C415 



520 



525 y 



Thr 


Glu 


Asp 


Ala 


Glu 


Leu 


Arg 


Thr 


Tyr 


Val 


Phe 


Tyr 


Val 


Glu 


Arg 


Glu 


Val 


Met 


Lys 


Gin 


Lys 


Thr 


Arg 


He 


Pro 


Val 


Ala 


Ala 


Leu 


Leu 


Gly 


Leu 


Asp 


Val 


Asn 


Leu 


Ser 


Pro 


Lys 


Lys 


He 


Lys 


Asp 


Leu 


Ser 


Glu 


Leu 


Thr 


Ser 


Pro 


Ser 


Tyr 


Val 


Phe 


Val 


Ser 


lie 


Trp 


Asp 


Leu 


Trp 


Arg 


He 


He 


Ala 


Val 


Thr 


Leu 


Asn 


Gly 


Thr 


Thr 


Ser 


Val 


He 


Asn 


Gly 


Pro 


Gin 


Val 


Lys 


Thr 


Leu 


Phe 


Leu 


Leu 


Val 


Thr 


Glu 


Gin 


Asp 


Gin 


He 


Glu 


Asn 


Lys 


Pro 


Leu 


Asn 


Gly 


Gin 


Thr 


Gin 


He 


Gly 


Val 


Gin 


Phe 


Asp 


Val 


Gin 


Lys 


Gin 


Asn 


Asn 


Arg 


Glu 


Thr 


Ala 


Asn 


Gly 


Pro 


Ser 


Asp 


Val 


Glv 


Pro 


Pro 


Gly 


Lys 


Pro 


Gly 


Leu 


Leu 


Pro 


Gly 


Asn 


Pro 


Gly 


Tyr 


Lys 


Pro 


Gly 


Tyr 


Gin 


Gly 


He 


Ser 


Pro 


Gly 


He 


Gin 


Gly 


Ala 


Glu 


Pro 


Gly 


Arg 


Asp 


Gly 


Asp 


Phe 


Pro 


Gly 


Leu 


His 


Gly 


Met 
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RAW SEQUENCE LISTING DATE : 05/14/2002 

PATENT APPLICATION: US/09/996, 611A TIME: 15:50:52 

Input Set : A:\SEQUENCE LISTING.txt 
Output Set: N:\CRF3\05142002\I996611A.raw 

98 Pro Gly Ser Lys Gly Glu Met Gly Ala Lys Gly Asp Lys Gly Ser 
E--> 99 530 535 540 

100 Pro Gly Phe Tyr Gly Lys Lys Gly Ala Lys Gly Glu Lys Gly Asn 

E--> 101 545 550 555 

102 Ala Gly Phe Pro Gly Leu Pro Gly Pro Ala Gly Glu Pro Gly Arg 

E--> 103 560 565 570 

104 His Gly Lys Asp Gly Leu Met Gly Ser Pro Gly Phe Lys Gly Glu 

E--> 105 575 580 585 

106 Ala Gly Ser Pro Gly Ala Pro Gly Gin Asp Gly Thr Arg Gly Glu 

E--> 107 590 595 600 

108 Pro Gly He Pro Gly Phe Pro Gly Asn Arg Gly Leu Met Gly Gin 

E--> 109 605 610 615 - ji 0 

110 Lys Gly Glu lie Gly Pro Pro Gly Gin Gin Gly Lys Lys Gly Ala S)s&*7^> 

E--> 111 620 625 630 * 

112 Pro Gly Met Pro Gly Leu Met Gly Ser Asn Gly Ser Pro Gly Gin 

E--> 113 635 640 645 

114 Pro Gly Thr Pro Gly Ser Lys Gly Ser Lys Gly Glu Pro Gly He 

E--> 115 650 655 660 

116 Gin Gly Met Pro Gly Ala Ser Gly Leu Lys Gly Glu Pro Gly Ala 

E--> 117 665 670 675 

118 Thr Gly Ser Pro Gly Glu Pro Gly Tyr Met Gly Leu Pro Gly He 

E--> 119 680 685 690 

120 Gin Gly Lys Lys Gly Asp Lys Gly Asn Gin Gly Glu Lys Gly He 
E--> 121 695 700 705 

122 Gin Gly Gin Lys Gly Glu Asn Gly Arg Gin Gly He Pro Gly Gin 
E--> 123 710 715 720 

124 Gin Gly He Gin Gly His His Gly Ala Lys Gly Glu Arg Gly Glu 

E--> 125 725 730 735 

126 Lys Gly Glu Pro Gly Val Arg Gly Ala He Gly Ser Lys Gly Glu 
E--> 127 740 745 750 

128 Ser Gly Val Asp Gly Leu Met Gly Pro Ala Gly Pro Lys Gly Gin 
E--> 129 755 760 765 

130 Pro Gly Asp Pro Gly Pro Gin Gly Pro Pro Gly Leu Asp Gly Lys 

E--> 131 770 775 780 

132 Pro Gly Arg Glu Phe Ser Glu Gin Phe He Arg Gin Val Cys Thr 

E--> 133 785 790 795 

134 Asp Val He Arg Ala Gin Leu Pro Val Leu Leu Gin Ser Gly Arg 

E--> 135 800 805 810 

136 He Arg Asn Cys Asp His Cys Leu Ser Gin His Gly Ser Pro Gly 

E--> 137 815 820 825 

138 He Pro Gly Pro Pro Gly Pro He Gly Pro Glu Gly Pro Arg Gly 

E--> 139 830 835 840 

140 Leu Pro Gly Leu Pro Gly Arg Asp Gly Val Pro Gly Leu Val Gly 
E--> 141 845 850 855 

142 Val Pro Gly Arg Pro Gly Val Arg Gly Leu Lys Gly Leu Pro Gly 
E--> 143 860 865 870 

144 Arg Asn Gly Glu Lys Gly Ser Gin Gly Phe Gly Tyr Pro Gly Glu 
E--> 145 875 880 885 

146 Gin Gly Pro Pro Gly Pro Pro Gly Pro Glu Gly Pro Pro Gly He 



file://C:\Crf3\Outhold\VsrI9966 1 1 A.htm 



5/14/02 



Page 4 of 9 




RAW SEQUENCE LISTING DATE: 05/14/2002 

PATENT APPLICATION: US/09/996 , 611A TIME: 15:50:52 

Input Set : A:\SEQUENCE LISTING.txt 
Output Set: N:\CRF3\05142002\I996611A.raw 

E--> 147 890 895 900 

148 Ser Lys Glu Gly Pro Pro Gly Asp Pro Gly Leu Pro Gly Lys Asp 
E--> 149 905 910 915 

150 Gly Asp His Gly Lys Pro Gly lie Gin Gly Gin Pro Gly Pro Pro 
E--> 151 920 925 930 

152 Gly lie Cys Asp Pro Ser Leu Cys Phe Ser Val lie Ala Arg Arg 
E--> 153 935 940 945 

154 Asp Pro Phe Arg Lys Gly Pro Asn Tyr 
E--> 155 950 954 

157 <210> SEQ ID NO: 2 

158 <211> LENGTH: 171 

159 <212> TYPE: PRT 

160 <213> ORGANISM: Homo sapiens 

162 <220> FEATURE: 

163 <223> OTHER INFORMATION: von Willebrand factor A domain 

165 <400> SEQUENCE: 2 

166 Asp Leu Val Phe He Leu Asp Gly Ser Tyr Ser Val Gly Pro Glu 
E--> 167 1 5 10 15 

168 Asn Phe Glu He Val Lys Lys Trp Leu Val Asn He Thr Lys Asn 
E--> 169 20 25 30 

170 Phe Asp He Gly Pro Lys Phe He Gin Val Gly Val Val Gin Tyr 
E--> 171 35 40 45 

172 Ser Asp Tyr Pro Val Leu Glu He Pro Leu Gly Ser Tyr Asp Ser 

E--> 173 50 55 60 />JLsV>-& 

174 Gly Glu His Leu Thr Ala Ala Val Glu Ser He Leu Tyr Leu Gly ^/> /v ^ 
E--> 175 65 70 75 

17 6 Gly Asn Thr Lys Thr Gly Lys Ala He Gin Phe Ala Leu Asp Tyr 
E--> 177 80 85 90 

178 Leu Phe Ala Lys Ser Ser Arg Phe Leu Thr Lys He Ala Val Val 
E--> 179 95 100 105 

180 Leu Thr Asp Gly Lys Ser Gin Asp Asp Val Lys Asp Ala Ala Gin 
E--> 181 110 115 120 

182 Ala Ala Arg Asp Ser Lys He Thr Leu Phe Ala He Gly Val Gly 
E--> 183 125 130 135 

184 Ser Glu Thr Glu Asp Ala Glu Leu Arg Ala He Ala Asn Lys Pro 
E--> 185 140 145 150 

186 Ser Ser Thr Tyr Val Phe Tyr Val Glu Asp Tyr He Ala He Ser 
E--> 187 155 160 165 

188 Lys He Arg Glu Val Met 
E--> 189 170 171 

192 <210> SEQ ID NO: 3 

193 <211> LENGTH: 183 

194 <212> TYPE: PRT 

195 <213> ORGANISM: Homo sapiens 

197 <220> FEATURE: 

198 <223> OTHER INFORMATION: Thrombospondin N- terminal- like domain 

200 <400> SEQUENCE: 3 

201 Gly Phe Asp He Leu Leu Gly Leu Asp Val Asn Lys Lys Val Lys 
E--> 202 15 10 15 
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RAW SEQUENCE LISTING DATE: 05/14/2002 

PATENT APPLICATION: US/09/996 , 611A TIME: 15:50:52 

Input Set : A:\SEQUENCE LISTING . txt 
Output Set: N:\CEF3\05142002\I996611A.raw 

203 Lys Arg lie Gin Leu Ser Pro Lys Lys lie Lys Gly Tyr Glu Val 

E--> 204 20 25 30 

205 Thr Ser Lys Val Asp Leu Ser Glu Leu Thr Ser Asn Val Phe Pro 

E--> 206 35 40 45 

207 Glu Gly Leu Pro Pro Ser Tyr Val Phe Val Ser Thr Gin Arg Phe 

E--> 208 50 55 60 

209 Lys Val Lys Lys lie Trp Asp Leu Trp Arg lie Leu Thr lie Asp 

E--> 210 65 70 75 

211 Gly Arg Pro Gin lie Ala Val Thr Leu Asn Gly Val Asp Lys lie 

E--> 212 80 85 90 

213 Leu Leu Phe Thr Thr Thr Ser Val lie Asn Gly Ser Gin Val Val 

E--> 214 95 100 105 

215 Thr Phe Ala Asn Pro Gin Val Lys Thr Leu Phe Asp Glu Gly Trp 

E--> 216 110 115 120 

217 His Gin lie Arg Leu Leu Val Thr Glu Gin Asp Val Thr Leu Tyr 

E--> 218 125 130 135 

219 lie Asp Asp Gin Gin lie Glu Asn Lys Pro Leu His Pro Val Leu 

E--> 220 140 145 150 

221 Gly lie Leu lie Asn Gly Gin Thr Gin lie Gly Lys Tyr Ser Gly 

E--> 222 155 160 165 

223 Lys Glu Glu Thr Val Gin Phe Asp Val Gin Lys Leu Arg lie Tyr 

E--> 224 170 175 180 

22 5 Cys Asp Pro 

E--> 226 183 

229 <210> SEQ ID NO: 4 

230 <211> LENGTH: 509 

231 <212> TYPE: PRT 

232 <213> ORGANISM: Homo sapiens 

234 <220> FEATURE: 

235 <223> OTHER INFORMATION: collagenous domain 

236 Pro in this sequence stands for hydroxyproline 
238 <400> SEQUENCE: 4 





239 


Gly Lys 


Pro Gly Leu 


Gin Gly 


Pro 


Lys 


Gly 


Asp 


Pro 


Gly 


Leu 


Pro 


E--> 


240 


1 5 


10 15 






















241 


Gly Asn 


Pro Gly Tyr 


Pro Gly 


Gin 


Pro 


Gly 


Gin 


Asp 


Gly 


Lys 


Pro 


E--> 


242 


20 


25 30 






















243 


Gly Tyr 


Gin Gly lie 


Ala Gly 


Thr 


Pro 


Gly 


Val 


Pro 


Gly 


Ser 


Pro 


E--> 


244 


35 


40 


45 




















245 


Gly lie 


Gin Gly Ala 


Arg Gly 


Leu 


Pro 


Gly 


Tyr 


Lys 


Gly 


Glu 


Pro 


E--> 


246 


50 


55 60 






















247 


Gly Arg 


Asp Gly Asp 


Lys Gly 


Asp 


Arg 


Gly 


Leu 


Pro 


Gly 


Phe 


Pro 


E--> 


248 


65 


70 75 






















249 


Gly Leu 


His Gly Met 


Pro Gly 


Ser 


Lys 


Gly 


Glu 


Met 


Gly 


Ala 


Lys 


E--> 


250 


80 


85 90 






















251 


Gly Asp 


Lys Gly Ser 


Pro Gly 


Phe 


Tyr 


Gly 


Lys 


Lys 


Gly 


Ala 


Lys 


E--> 


252 


95 


100 105 






















253 


Gly Glu 


Lys Gly Asn 


Ala Gly 


Phe 


Pro 


Gly 


Leu 


Pro 


Gly 


Pro 


Ala 


E--> 


254 


110 


115 120 




















255 


Gly Glu 


Pro Gly Arg 


His Gly 


Lys 


Asp 


Gly 


Leu 


Met 


Gly 


Ser 


Pro 
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RAW SEQUENCE LISTING DATE: 05/14/2002 

PATENT APPLICATION: US/09/996, 611A TIME: 15:50:52 

Input Set : A:\SEQUENCE LISTING . txt 
Output Set: N:\CRF3\05142002\I996611A.raw 



E- 


-> 


256 

257 


125 

Gly Phe 


130 

Lys Gly 


135 
Glu Ala 


Gly 


Ser 


Pro 


Gly 


Ala 


Pro Gly Gin 


Asp 


E- 


-> 


258 

259 


140 

Gly Thr 


145 

Arg Gly 


150 

Glu Pro Gly 


He 


Pro 


Gly 


Phe 


Pro 


Gly 


Asn 


Arg 


E- 


-> 


260 

261 


155 

Gly Leu 


160 

Met Gly 


165 

Gin Lys Gly Glu 


He 


Gly 


Pro 


Pro 


Gly 


Gin 


Gin 


E- 


-> 


262 

263 


170 

Gly Lys 


175 

Lys Gly 


180 

Ala Pro Gly Met 


Pro 


Gly 


Leu 


Met 


Gly 


Ser 


Asn 


E- 


-> 


264 

265 


185 

Gly Ser 


190 

Pro Gly 


195 

Gin Pro 


Gly 


Thr 


Pro Gly 


Ser 


Lys 


Gly 


Ser 


Lys 


E- 


-> 


266 

267 


200 

Gly Glu 


205 

Pro Gly 


He Gin 


210 

Gly Met 


Pro 


Gly 


Ala 


Ser 


Gly 


Leu 


Lys 


E- 


-> 


268 

269 


215 

Gly Glu 


220 

Pro Gly 


Ala Thr 


225 

Gly 


Ser 


Pro 


Gly 


Glu 


Pro 


Gly 


Tyr 


Met 


E- 


-> 


270 

271 


230 

Gly Leu 


235 

Pro Gly 


240 

He Gin 


Gly 


Lys 


Lys 


Gly 


Asp 


Lys 


Gly 


Asn 


Gin 


E- 


-> 


272 

273 


245 

Gly Glu 


250 

Lys Gly 


255 

He Gin Gly Gin 


Lys 


Gly 


Glu 


Asn 


Gly 


Arg 


Gin 


E- 


-> 


274 

275 


260 

Gly He 


265 

Pro Gly 


270 

Gin Gin 


Gly 


He 


Gin 


Gly 


His 


His 


Gly 


Ala 


Lys 


E- 


-> 


276 

277 


275 280 

Gly Glu Arg Gly 


Glu Lys 


285 

Gly 


Glu 


Pro Gly Val 


Arg 


Gly 


Ala 


He 


E- 


-> 


278 

279 


290 

Gly Ser 


295 

Lys Gly 


300 

Glu Ser Gly Val 


Asp 


Gly 


Leu 


Met 


Gly 


Pro 


Ala 


E- 


-> 


280 

281 


305 

Gly Pro 


310 

Lys Gly 


315 

Gin Pro 


Gly Asp 


Pro 


Gly 


Pro 


Gin 


Gly 


Pro 


Pro 


E- 


-> 


282 

283 


320 

Gly Leu 


325 

Asp Gly 


330 

Lys Pro Gly Arg 


Glu 


Phe 


Ser 


Glu 


Gin 


Phe 


He 


E- 


-> 


284 

285 


335 

Arg Gin 


340 

Val Cys 


345 

Thr Asp 


Val 


He 


Arg 


Ala 


Gin 


Leu 


Pro 


Val 


Leu 


E- 


-> 


286 

287 


350 

Leu Gin 


355 

Ser Gly 


360 

Arg He 


Arg 


Asn 


Cys 


Asp 


His 


Cys 


Leu 


Ser 


Gin 


E- 


-> 


288 

289 


365 

His Gly 


370 

Ser Pro 


375 

Gly He 


Pro Gly 


Pro 


Pro 


Gly 


Pro 


He 


Gly 


Pro 


E- 


-> 


290 

291 


380 

Glu Gly 


385 

Pro Arg 


Gly Leu 


390 

Pro 


Gly 


Leu 


Pro 


Gly 


Arg 


Asp 


Gly 


Val 


E- 


-> 


292 

293 


395 

Pro Gly 


400 

Leu Val 


405 

Gly Val 


Pro 


Gly Arg 


Pro 


Gly 


Val 


Arg 


Gly 


Leu 


E- 


-> 


294 

295 


410 

Lys Gly 


415 

Leu Pro 


420 

Gly Arg 


Asn 


Gly 


Glu 


Lys 


Gly 


Ser 


Gin 


Gly 


Phe 


E- 


-> 


296 

297 


425 

Gly Tyr 


430 

Pro Gly 


435 

Glu Gin 


Gly 


Pro 


Pro 


Gly 


Pro 


Pro 


Gly 


Pro 


Glu 


E- 


-> 


298 

299 


440 

Gly Pro 


445 

Pro Gly 


450 

He Ser 


Lys 


Glu 


Gly 


Pro 


Pro 


Gly 


Asp 


Pro 


Gly 


E- 


-> 


300 

301 


455 460 

Leu Pro Gly Lys 


Asp Gly 


465 

Asp 


His 


Gly 


Lys 


Pro 


Gly 


He 


Gin 


Gly 


E- 


-> 


302 

303 


470 

Gin Pro 


475 

Gly Pro 


480 

Pro Gly 


He 


Cys 


Asp 


Pro 


Ser 


Leu 


Cys 


Phe 


Ser 


E- 


-> 


304 


485 


490 




495 
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RAW SEQUENCE LISTING DATE : 05/14/2002 

PATENT APPLICATION: US/09/996 , 611A TIME: 15:50:52 

Input Set : A:\SEQUENCE LISTING . txt 
Output Set: N:\CRF3\05142002\I996611A.raw 

305 Val He Ala Arg Arg Asp Pro Phe Arg Lys Gly Pro Asn Tyr 
E--> 306 500 505 509 
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VERIFICATION SUMMARY DATE: 05/14/2002 

PATENT APPLICATION: US/09/996, 611A TIME: 15:50:53 

Input Set : A:\SEQUENCE LISTING . txt 
Output Set: N:\CRF3\05142002\I996611A.raw 

L:29 M:332 E: (32) Invalid/Missing Amino Acid Numbering, SEQ ID:1 
M:332 Repeated in SeqNo=l 

L:167 M:332 E: (32) Invalid/Missing Amino Acid Numbering, SEQ ID:2 
M:332 Repeated in SeqNo=2 

L:202 M:332 E: (32) Invalid/Missing Amino Acid Numbering, SEQ ID: 3 
M:332 Repeated in SeqNo=3 

L:240 M:332 E: (32) Invalid/Missing Amino Acid Numbering, SEQ ID: 4 
M:332 Repeated in SeqNo=4 
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